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DOCUMENT- IDENTIFIER : US 6405219 B2 

TITLE: Method and system for automatically updating the version of a set of files 
stored on content servers 



Abstract Text (1) : 

A method and system for managing the replication and version synchronization of 
updates to a set of source files on geographically distributed heterogeneous content 
servers with minimal impact on a network's bandwidth. The configuration of each 
content server is either manually entered or automatically determined. The current 
version of the source files are created on at least one source server. A Primary 
global server stores a copy of the current version of the set of the source files 
along with the configuration of each content server. The Primary global server 
generates and distributes a particular version change container and version 
distribution list to each remotely located Secondary global server. Each Secondary 
global server employs the version distribution list and the contents of the version 
change container to identify the current version of each source file necessary to 
upgrade the set of source files on each local content server. Each identified source 
file is copied to a sub-directory on each local content server associated with the 
Secondary global server. At each local content server, the renaming of each copied 
source file is employed to update to the current version of the set of source files on 
the content server. A versioned file tree repository for the set of source files 
includes archived objects. When the version distribution list identifies a previous 
version, the current version of source files on the local content servers can be 
rolled back to the previous version. 
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US -PAT-NO : 6418452 

DOCUMENT- IDENTIFIER: US 6418452 Bl 

TITLE: Network repository service directory for efficient web crawling 
DATE -ISSUED: July 9, 2 002 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Kraft; Reiner Gilroy CA 

Emens; Michael Lawrence San Jose CA 

US -CL- CURRENT: 707^200; 707/1, 707/10, 707 / 104.1 , 707/201, 707/203, 709/201, 709/203, 
709 / 223 , 709 / 224 , 709 / 225 , 709 / 226 



ABSTRACT: 

A master repository service maintains a directory of web servers and the most recent 
times that their web contents were modified, and provides this information to web 
crawlers to increase their efficiency. The master repository service receives web 
content update reports from a plurality of web servers, updates the directory to keep 
it current, and provides crawlers with web site modification information. The web site 
modification information preferably comprises identifiers for new web sites, "dead" 
web sites, and modified web sites. Each crawler is preferably provided only with web 
site modification information received since it last received information from the 
master repository service. The information allows web crawlers to know immediately 
about new web sites, and allows them to spend time visiting only those web sites that 
are new or that have changed their content. 

15 Claims, 2 Drawing figures 
Exemplary Claim Number: 1 
Number of Drawing Sheets: 2 
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DOCUMENT -IDENTIFIER: US 5842216 A 

TITLE: System for sending small positive data notification messages over a network to 
indicate that a recipient node should obtain a particular version of a particular data 
item 

DATE- ISSUED: November 24, 1998 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Anderson; David Belmont MA 

Waters; Richard C. Concord MA 

US - CL - CURRENT : 707/203; 707/201, 709/203, 717/170 



A system is provided for eliminating time-consuming, unnecessary transfers f data over 
networks such as the the World Wide Web while at the same time guaranteeing timeliness 
of the data used by recipients. Timeliness is assured by immediately sending small 
data-notification messages whenever data becomes relevant or changes. Efficiency is 
guaranteed by transmitting the data itself only when requested by the recipient of a 
data-notification message. In particular, recipients are alerted to the presence of, 
and changes in, data they might use by data-notification messages containing a 
timestamp, the data location, and a checksum. Based on the timestamp, the recipient 
can determine whether the data-notification message contains timely information or 
should be ignored. Based on the data location and checksum, the recipient can 
determine whether it already has the current version of the data in question, for 
example stored in a cache. 

9 Claims, 7 Drawing figures 
Exemplary Claim Number: 1 
Number of Drawing Sheets : 6 
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Set Name Query Hit Count Set Name 

side by side result set 

DB=USPT; PLUR=YES; OP=ADJ 

(identify$3 or determin$3) same (modification or changes or update) 
L12 same (master or primary or orginal or source) near3(copy or file or 67 L12 

volume ) same (replicat$3 or propogat$3 or transmit$4) 

(identify$3 or determin$3) same (modification or changes or update) 
Lll near3 (master or primary or orginal or source) near5 (file or volume ) 6 LI 1 

same (replicat$3 or propogat$3 or transmit$4) 

(identify$3 or determin$3) nearS (modification or changes or update) 
L10 near3 (master or primary or orginal or source) (file or volume) same 1 L10 

(replicat$3 or propogat$3 or transmit$4) 

(identify$3 or determin$3) nearS (modification or changes) near3 (file or 
L9 volume or repository) same (master or primary or orginal or source) 0 L9 

same (replicat$3 or propogat$3 or transmit$4) sa 

L8 5842216[uref] 15 L8 

L7 L6 same (web server or web) 1 L7 

(identify$3 or determin$3) nearS (modification or changes) same (master 
L6 or primary or orginal or source) same (replicat$3 or propogat$3 or 404 L6 

transmit$4) 

web server with (master or primary or orginal or source) same 

— (replicat$3 or distribut$3 or propogat$3) — 

(5845290| 5860071| 5890152| 6038610| 6073135| 6182085| 6249795| 

— 6263364| 6269370| 6292894| 6295529) ![pn] 1 1 — 

web server with (master or primary or orginal or source) same 
L3 (replicat$3 or distribut$3 or propogat$3) same (notif$6 or informa$6 or 14 L3 

feedback) 

(web or server or internet) with (master or primary or orginal or source) 
L2 same (replicat$3 or distribut$3 or propogat$3) same (notif$6 or 604 L2 

informa$6 or feedback) 

LI 61821 17.pn. 1 Li 
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